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METHODS AND COMPOSITIONS FOR DIAGNOSIS AND TREATMENT OF B CELL 
CHRONIC LYMPHOCYTIC LEUKEMIA 

5 

Cross-Reference to Related Application 

This application claims the benefit of U.S. Provisional Application No. 60/509,473, filed 
Oct. 8, 2003. 



10 Statement Regarding Federally Funded Research or Development 

The U.S. Government has a paid-up license in this invention and the right in limited 
circumstances to require the patent owner to license others on reasonable terms as provided by the 
terms of Grants No. CA 81554 and CA 87956 awarded by the National Institutes of Health. 



15 Background 

The present invention generally relates to methods of diagnosis and treatment of B cell 
chronic lymphocytic leukemia (B-CLL). More particularly, the invention relates to methods of B- 
CLL diagnosis and treatment based on the presence of sets of B-CLL patients that have B cell 
receptor genes in common. 
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B cell chronic lymphocytic leukemia (B-CLL) is an accumulative disease of slowly 
proliferating CD5"*" B lymphocytes that develops in the aging population. Whereas some patients 
5 with B-CLL have an indolent course and die after many years from unrelated causes, others 
progress very rapidly and succumb within a few years from this currently incurable leukemia. 
Over the past decade, studies of the structure and function of the B cell antigen receptor (BCR) 
used by these leukemic cells have helped redefine the nature of this disease. 

CD5^ B lymphocytes in B-CLL patients express low levels of surface membrane Ig that 

10 serves as their receptor for antigen (BCR). The genetics of this Ig have clinical relevance, as 
patients with an Ig that is unmutated in the variable (V) regions have a significantly worse 
outcome than those with significant numbers of mutations in the Ig V region. The biological 
basis by which the Ig molecule/BCR associates with these distinct outcomes is unclear. 

There are several lines of evidence supporting a role for the Ig molecule in the evolution 

15 of B-CLL. Analysis of V region gene cassette usage has provided inferential evidence that the Ig 
molecules on B-CLL cells are not the product of random chance. The distribution of variable 
region gene cassettes used by B-CLL clones (Schroeder and Dighiero, 1994) differs from that 
found in normal cells (Brezinschek et ai., 1997) with an increased jEcequency of Vh 3-07, Vh 4-34, 
and Vh 1-69 genes (Fais et al., 1998). Furthermore, the distribution of mutations among B-CLL 

20 cases using these specific Vh genes is selectively and strikingly biased. For instance, the Vh 

genes of '-40% of B-CLL cases contain <2% differences from the most similar germline gene and 
-25% are identical to a germline Vh counterpart. However, 80% of the cases that use a Vh 1-69 
are germline and -90% of these have less than 2% mutation. Conversely, in 93% of cases the Vh 
3-07 gene exhibits significant numbers of mutations ^2% difference from the germline gene). 

25 These deviations from randomness in gene use and acquisition of somatic mutations imply that 

the structure of the antibody molecule, and possibly its antigen specificity thus manifest, played a 
role in the leukemic transformation of particular B cells. 

More recently, sets of B-CLL cases with highly similar Ig molecules have been identified. 
Our laboratory identified five unmutated IgG-expressing B-CLL cases in which the BCR was 

30 remarkably similar in structure (Ghiotto et al, 2003). These Ig molecules used the same Vh, D, 
Jh, and in all but one instance the same Vr-Jk- Furthermore, the HCDR3s were highly similar in 
sequence and the LCDR3s were virtually identical with a Vk-Jr junction contained an invariant, 
non-templated arginine codon. A larger set of patients expressing a Vh3-21/Jh3 H chain and a 
VX-3h/JA3 L chain have been described by Tobin et al. (2003). These cases also have a HCDR3 

35 that is small and of very similar sequence. The VH3-21 gene is not found at high frequency 
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outside of northern Europe, suggesting an environmental or genetic influence. The patients from 
both of these groups have a poor clinical course that does not necessarily relate to their VH 
mutation status. 

Functional studies have shown that patients with unmutated Ig V regions can transduce 
signals through the B cell receptor (BCR), while. the mutated BCR cannot. This finding could 
have major significance since it provides a means by which antigen binding to the BCR might 
affect the biology of the leukemic cells in vivo. This is especially relevant since many B-CLL 
cases synthesize autoreactive Ig/BCR molecules (Broker et al., 1988; Borche et al., 1990; 
Sthoeger et al., 1993) and/or use VH genes that are often found in autoantibodies (Fais et al., 
1998). This is consistent with the derivation of the leukemic cells from CD5^ B-cells that in 
normal individuals are considered the primary source of natural antibodies (Casali and Schettino, 
1996). 

Despite recent identification of several biomarkers associated with outcome in B-CLL, 
there is a need for additional prognostic indicators for this disease. Also, there is a long-standing 
need for therapeutic targets and new therapeutic modalities in B-CLL, for which there is no 
generally accepted and specific curative regimen. The present invention addresses these needs. 

Summary of the Invention 

Accordingly, the inventors have discovered that the B-CLL cells of a significant 
20 proportion of B-CLL patients with an aggressive form of the disease share the same classes of Vh, 
D, Jh, Vl, and Jl antibody genes as other B-CLL patients, forming "sets" of B-CLL patients with 
highly homologous B cell receptors. This discovery makes practical various therapeutic and 
diagnostic methods. 

Thus, in some embodiments, the invention is directed to isolated and purified preparations 
25 of a combination of a light chain antibody ^ne and a heavy chain antibody g^ne. In these 

preparations, the family members of the ligjit chain antibody gene and the heavy chain antibody 
gene are selected from the group consisting of VH4-39/D6-13/JH5ArLK012/2/JLKl/K2 (Set I), Vh4- 
34/D5-5/Jh6A^lkA17/Jlk1/k2 (Set II), VH3-21/JH6A^i>3h/Ji>3 (Set IE), Vh1-69/D3- 
16/Jh3A^lkA27/Jlk1/k4 (Set IV). Vh1-69/D3-10/Jh6A^l^1c/JiA1 (Set V), Vh1-02/D6- 
30 19/Jh4A^lk012/2/Jlk1/k2 (Set Via), VHl-03/D6-19/JH4ArLKO12/2/JLKl/K2 (Set VIb), VhH8/D6- 
19/Jh4A^lk012/2/Jlk1 (Set VIc), Vh1-46/D6-19/Jh4 (Set VId), Vh5-51/D6- 
19/Jh4A^lk012/2/Jlk2 (Set Vie), VHl-69/D3-3/JH4rVLKA19/JLK4 (Set VH), and Vh1-69/D2- 
2/JH6A^LTdL6/2/JLK3 (Set VIH). 
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The invention is also directed to cells in culture comprising at least one vector comprising 
antibody genes from Set I, Set n. Set m. Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set 
VIe,SetVn,orSetVin. 

In other embodiments, the invention is directed to isolated and purified antibodies 
5 encoded by antibody genes from Set I, Set H, Set m. Set IV, Set V, Set Via, Set VIb, Set Vic, Set 
VId, Set Vie, Set VH, or Set Vm. 

In further embodiments, the invention is directed to anti-idiotype antibodies that bind to 
the antigen-binding region of an antibody encoded by antibody genes from Set I, Set Set HI, 
Set IV, Set V, Set Via, Set VIb, Set VIc, Set Vld, Set Vie, Set VH, or Set VHI. 
10 The invention is additionally directed to hybridomas expressing any of the above- 

described antibodies. 

In related embodiments, the invention is directed to bispecific antibodies comprising the 
binding site of the above-described anti-idiotype antibodies and a binding site that binds to 
another B-cell antigen. 

15 The present invention is additionally directed to peptide antigens that bind to the antigen- 

binding region of an antibody encoded by antibody genes of Set I, Set II, Set HI, Set IV, Set V, 
Set Via, Set VIb, Set VIc, Set VId, Set Vie, Set VU, or Set VIH. 

In further embodiments, the invention is directed to aptamers that bind to the antigen- 
binding region of an antibody encoded by antibody genes of of Set I, Set n. Set HI, Set TV, Set V, 

20 Set Via, Set VIb, Set VIc, Set VId, Set Vie, Set VD, or Set Vm. 

The present invention is also directed to multimeric molecules comprising at least a first 
and a second binding site. In these embodiments, the first binding site binds to the antigen- 
binding region of an antibody encoded by antibody genes of Set I, Set n. Set III, Set IV, Set V, 
Set Via, Set VIb, Set VIc, Set VId, Set Vie, Set VII, or Set Vm, and the second binding site binds 

25 to either (a) the antigen-binding region of an antibody encoded by antibody genes of Set I, Set II, 
Set m, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, Set VII, or Set VIII or <b) a B- 
cell antigen. 

The invention is additionally directed to isolated and purified preparations of a 
combination of a light chain antibody gene and a heavy chain antibody gene. In these 
30 embodiments, the gene family members of the light chain antibody gene and the heavy clxain 

antibody gene are present in B cells of two or more patients, and the antibody chains of the B cells 
also share the same isotype, JH, D and JL regions, and the B cells are lymphoproliferative in the 
patient, or the patient has an autoimmune disease involving the B cells. 

In other embodiments, the invention is directed to methods of determining whether a 
35 patient with B cell chronic lymphocytic leukemia (B-CLL) has a form of B-CLL that is 
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susceptible to treatment directed to eliminating idiotype-specific B cell receptor-bearing B-CLL 
cells. The method comprises determining whether the B cell receptors on the patient's B-CLL 
cells have an idiotype encoded by antibody genes from Set I, Set H, Set in. Set IV, Set V, Set 
Via, Set VIb, Set Vic, Set VId, Set Vie, Set VH, or Set Vm. 
5 In related embodiments, the present invention is directed to methods of following the 

progression of treatment of B-CLL in the patient identified by the above-described method as 
having a form of B-CLL susceptible to treatment directed to eliminating idiotype-specific B cell 
receptor-bearing B-CLL cells. The methods comprise deternuning whether the B cell receptors 
on the B-CLL cells have an idiotype encoded by antibody genes from Set I, Set II, Set EH, Set IV, • 

10 Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set VH, or Set VIE, 

In further embodiments, the invention is directed to methods of treating a patient having 
B-CLL, where the B-CLL is caused by B cells comprising antibody genes from Set I, Set II, Set 
m. Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId or Set Vie, Set VH, or Set VHI. The 
methods comprise administering to the patient any of the anti-idiotype antibodies, peptide 

15 antigens, or aptamers described above, or mixtures thereof. 

In additional embodiments, the invention is directed to methods of identi^ing a B-CLL 
set. The methods comprise identifying the VH, D, JH, VL, and JL antibody gene families present 
on B-CLL ceils, where the same antibody gene families are all present in more than one B-CLL 
patient. 

20 

Brief Description of the Drawings 

FIG. 1 provides VH, D and JH regions of antibody genes from B-CLL cells of Sets I-VIe. 

FIG. 2 shows amino acid alignments of the H chain V regions of all sequences in Sets II, 
IV, V, Vla-e, and VIU. A period indicates homology with the germline gene. Amino acids in 
25 gray are chemically similar to the germline-encoded residues. Underlined positions are known 
sites of allelic polymorphism. The consensus sequence for the set is shown at the bottom of each 
alignment. 

FIG. 3 shows amino acid alignments of the L chain variable regions of all sequences in 
Sets n, IV, V, VI, and VIU. See FIG. 2 description above. 

30 FIG. 4 shows amino acid and nucleotide sequences of the CDR3 and its junctions of set 

IV. The H chain sequences are shown at left, and the L chain sequences are shown at right. The 
most similar gemdine genes are shown at top. Dots indicate homology with the germline 
sequence. Dashes indicate no sequence at that position. The numbering at bottom is for 
convenience of reference and is arbitrary. Sequences from fhe public databases have their 

35 GenBank accession number in parenthesis below the case ID. Distinctive junctional residues 
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exist, including a pair of G codons at the VH-D junction and an N codon at the D-JH junction. 
The creation of the G codon at the VH-D junction required trimming of the 3' adenosine 
nucleotide at the end of IgVH, along with N addition. Also, limited trimming at the 5* end of the 
D segment eliminated the first of the pair of Y codons in all cases. In two instances, D replaced Y 
5 and in two other cases N does the same; both of these are charged residues that fit at the negative 
end of the Kyte-Doolittle scale. The Y codon at the 3' end of the D gene was also eliminated in 
all sequences of this set. Collectively, these conserved junctional adjustments suggest strong 
selection for HCDR3 stmcture. Three rearranged L chain sequences were available for this set 
and both contained the Vf^27 gene associated with Jfzl, Jk4, or Jk5. 

10 FIG. 5 shows amino acid and nucleotide sequences of the CDR3 and its junctions of Set 

Vin. The VH-D junctions are dominated by non-templatedGs. The D-JH junction exhibits 
evidence of trimming and fill-in, with an alteration to M where the final D encoded residue would 
be found. This is not a known site of polymorphism, although that explanation cannot be 
excluded. Only one L chain sequence was available for this set (GO 13), and this consisted of the 

15 VkL6 and Jn3 genes. There was significant overlap between the germline segments at the VL-JL 
junction. 

FIG. 6 shows amino acid and nucleotide sequences of the CDR3 and its junctions of Set 
V. In these sequences, the 5' end of the germline D gene overlaps the 3' end of the germline IgVH 
segment to form the VH-D junction. The presence of several nucleotides that do not match either 

20 germline sequence in the overlap region suggests that trinaming and addition occurred, resulting in 
a preferred insertion of a residue with a small (A, S, and V) or no (G) side chain. Tfcae amino acids 
at the D-JH junction are not well conserved. However, the consistent relative positioning of the 
V/f, D, and JH segments is intriguing because the region of overlap between the VH and D does 
not contain significant homology as might be predicted for preferential recombination. This 

25 suggests selection for HCDR3 configuration and D-encoded residues rather than specific 

junctional residues. Two rearranged L chain sequences were available from this set <RF22 and 
GN12) and both were conaprised of VA1.16 (ic) and Jkl segments. The level of mutation of both 
the H and L chains in the members of sets IV, V, and VIII was always <2%, which is consistent 
with published reports of the frequent lack or scarcity of mutations in the VHl'69 in B-CLL 

30 (Kipps et al., 1989; Schroeder et al., 1994; Pais et al., 1998). 

FIG. 7 shows amino acid and nucleotide sequences of the CDR3 and its junctions of Set 
n. The H chain junctions of the sequences in this set of five cases are quite constraiaed. The 
position of the D (P5-5) relative to both VH (VH 4-34) and JH (JH6) segments is identical for 
each member, leading to equal HCDR3 lengths. The VH-D and D-JH junctions both contain 

35 evidence of trimming and addition. These processes produced an aromatic residue (W, Y, F) at 
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the VH-D junction (position 5) followed by a hydrophobic residue (G, P, or A at position 6) and a 
pair of codons encoding basic residues (K or R) at the D-JH junction (positions 12 and 13). At 
position 9 in the D segment, four out of the five HCDR3 sequences exhibit a P rather than an A 
found in the canonical D5-5 segment deposited in the public databases. Although this is most 
5 likely a polymorphism of the D5-5 segment rather than a common mutation, the last of the five 
sequences in this set (CLL ID47) also deviates from the canonical D5-5 sequence at this codon, 
substituting a D. These highly conserved alterations of the VH-D-JH junctions suggest selection 
for a very particular HCDR3 structure. The rearranged L chains of this set are also very similar. 
All three available VLJL sequences use VkA77 and either JkI or Jk2. The junctions are highly 
10 similar with only a single difference that results from an abbreviated recombination that 

eliminates the junctional P from CLL240. These cases are of the IgG isotype. Like most IgG^ B- 
CLL cases that express a switched isotype (Fais et £d., , 1998; Hashimoto et al., 1992; Ghiotto et 
al., 2004), these cases exceed the 2% difference from germline, albeit slightly, and are thus 
classified as mutated. 

15 FIG. 8 shows amino acid and nucleotide sequences of the CDR3 and its junctions of set 

VI, The VHl- 02 germline sequence is shown. There are no sequence differences between VHl- 
02 and VHl-OSy 1-18^ 1-46, or 5-51 for the displayed region. The JkI gene is shown, and 
homology between CLLOl 1 and CLL-412 and Jk2 at positions where the germline sequence of J 
is2 and J«i are different is indicated with an asterisk. This set is composed of five subsets, totally 

20 22 patients that share HCDR3 and VLJL characteristics but incorporate different IgVH genes (i- 
02, 7-05, l'18y 1-46, and 5-57). Each of these genes belongs to the same VH clan (Kirkham et al., 
1992). The HCDR3 of these subsets all share a precise VHD overlap. Curiously, the JD6-19 
segment was used in a nonproductive reading frame. However, this stop codon was in the region 
of overlap with the terminal IgVH sequence and was trimmed, thereby allowing productive 

25 rearrangements with the JH4 segment. The D-JH Jimctions contmn evidence for trimming and 
addition. The first nongermiine templated codon after the D segment is enriched in redundant L 
codons, but the remaining jxmctional codons are not tightly conserved. All the rearraaged L 
chains available for this set use the Vk 012/2 gene with use restricted to Ml and Jfc2. Of these 
10 sequences, 9 are essentially identical to that of the germline in the LCDR3 and junctional 

30 regions. Thus, this set is unified not only by its conomon HCDR3 structure and motifs but also by 
the use of a virtually identical VLJL partner with a very restricted LCDR3 conq)ositioii. 



35 



Detailed Description of the Invention 

The present invention is based on the discovery that a significant proportion of B-CLL 
patients having genetic and protein markers consistent with an aggressive form of the disease or a 
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manifestly .aggressive form of the disease regardless of said markers, have B-CLL cells with B 
cell receptors encoded by antibody gene family members that other B-CLL patients also have. 
The inventors have identified at least 10 sets of patients (see Table 1 in the Example), where the 
patients within each set have the same B-CLL B cell receptor antibody genes. This accounts for 
5 approximately 10% of B-CLL patients, and about 20% of those patients that have genetic and 
protein markers consistent with an aggressive form of the disease. See the Example for details 
relating to the discovery of these sets. 

As is known, aggressive forms of B-CLL are correlated with B cells that have relatively 
few IgV gene mutations and have intercellular expression of ZAP-70, and cell surface expression 

10 of CD38 and CD23. These markers are evaluated at first diagnosis to predict which patients will 
have an aggressive form of the disease, in order to deternune a course of treatment. Because the 
B-CLL cells from patients belonging to identified "sets" with common B cell receptor genes have 
low or absent IgV mutations (see Table 1 in Example), it is predicted that patients having B-CLL 
cells from each of these sets will have an aggressive form of the disease. 

15 The Figures provide relevant sequences of the B cell receptor antibodies and antibody 

genes of B-CLL cells of several patients in the sets. Notable is the relatively small amount of 
variation within each set in the number of nucleotides added during the VH-D-JH and VL-JL 
recombinations. 

While two of these sets (Sets I and HI) have been previously identified, it was believed 

20 that those two sets were anomalous and were not expected to account for more than a small 

fraction of B-CLL cases. Thus, the discovery, disclosed herein, of multiple other sets that account 
for a significant proportion of patients with B-CLL, in particular the apparently aggressive form 
of the disease, makes practical the use of various methods and compositions for diagnosis and 
treatment of B-CLL, based on the sets identified. 

25 Thus, in some embodiments, the present invention is directed to isolated and purified 

preparations of a combination of a light chain antibody gene and a heavy chain antibody gene. 
The fanoily members of the light chain antibody gene and the heavy chain antibody gene of these 
preparations make up any one of the following sets: VH4-39/D6-13/JH5An^K012/2/JLKl/K2 (Set 
I), VH4-34/D5.5/JH6AnLKA17/JLKl/ic2 (Set D), VH3.21/JH6A^A3h/JLA3 (Set HI), VH1-69/D3- 

30 16/JH3ATLkA27/JLk1/k4 (Set IV), VHl-69/D3-10/JH6Aa.A.lc/JLX.l (Set V), VH1-02/D6- 

19/JH4Aa.K012/2/JLKl/it2 (Set Via); VHl-03/D6-19/JH4An.KO12/2/JLKl/K2 (Set VTb); VHl- 
18/D6-19/JH4Aa.K012/2/JLKl (Set Vic); VH1-46/D6-19/JH4 (Set VId); VH5-51/D6- 
19/JH4m.K012/2/JLK2 (Set Vie), VH1-69/D3.3/JH4A^kA19/JLk4 (Set VID, and Vh1-69/D2- 
2/Jh6A^lkL6/2/JlK3 (Set VHT). In some preferred embodiments, the family members of the light 

35 chain antibody gene and the heavy chain antibody gene are selected from the group consisting of 
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Set n. Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set VH, and Set VIE; in other 
preferred embodiments, the family members of the Hght chain antibody gene and the heavy chain 
antibody gene are selected from the group consisting of Set n, Set IV, Set V, Set Via, Set VIb, Set 
Vic, Set VId, Set Vie, and Set VH. In additional preferred embodiments, the family members of 
5 the light chain antibody gene and the heavy chain antibody gene are selected from the group 
consisting of Set 11, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, and Set Vm, In 
still other preferred embodiments, the family members of the light chain antibody gene and the 
heavy chain antibody gene are selected from the group consisting of Set I, Set H, Set HI, Set TV, 
Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, and Set VH. 

10 These preparations, comprising the antibody genes of each of the 12 identified sets, are 

useful for preparing reagents for diagnosis and treatment methods described below. Such useful 
reagents include compounds that specifically bind to the antigen binding site of the antibodies 
encoded by these genes, as further described below. 

The antibody genes in these sets can be identified without undue experimentation by 

15 known methods, e.g., as described in the Example, using routine sequencing methods. The 
antibody genes are categorized herein as from a particular germline gene even if the antibody 
gene has several mutations. 

The combination of antibody genes can be in any form, including single chain genes, as 
are knovm in the art. Preferably, the antibody genes are on a vector or vectors, such as a plasmid 

20 or viral vector, in order to facilitate their maintenance, as with a cloning vector, and to be able to 
produce the antibodies encoded by the genes, as with an expression vector. Ceils in culture 
comprising a vector comprising antibody genes from Set I, Set 11, Set HI, Set IV, Set V, Set Via, 
Set VIb, Set VIc, Set VId, Set Vie, Set VII, or Set Vm are also envisioned. Preferably, the 
antibody genes are selected from the group consisting of Set H, Set IV, Set V, Set Via, Set VIb, 

25 Set VIc, Set VId, Set Vie, Set VH, and Set Vm, or Set H, Set IV, Set V, Set Via, Set VTb, Set 

VIc, Set VId, Set Vie, and Set VE, or Set II, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set 
Vie, and Set Vm, or Set I, Set n. Set m. Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set 
Vie, and Set VH. 

In other embodiments, the invention is directed to isolated and purified antibodies 
30 encoded by antibody genes from one of Set I, Set II, Set m, Set IV, Set V, Set Via, Set VIb, Set 
VIc, Set VId, Set Vie, Set VH, or Set VIE. Preferably, the antibody genes are selected from the 
group consisting of Set II, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, Set VH, and 
Set Vm, or Set H, Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, and Set VH, or Set 
n. Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, and Set Vm, or Set I, Set II, Set HI, 
35 Set IV, Set V, Set Via. Set VIb, Set VIc, Set VId, Set Vie, and Set VH. As previously discussed. 
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these antibodies, which are expressed as the B cell receptor on the B-CLL cells from individuals 
in the identified sets, can be used to identify reagents that bind to the antibody's antigen binding 
site. These antibodies can be produced by any known method. Non-limiting examples include 
antibodies from a hybridoraa made from the CLL cells and antibodies from cloned antibody 
5 genes. As used herein, the antibodies can be in any form that includes at least one antigen binding 
region. The term "antibody" thus includes an Fab, Fab2, or Fv fragment. The present invention 
also includes hybridomas that produce the above antibodies. 

As is known in the art, a consensus sequence for each set can be identified that provides 
the amino acid sequence that is most similar to the sequence of the antibodies of all members of 
10 the set. This consensus sequence can be used to identify an antibody binding site that is most 

similar to all the members of the set, in order to most efficiently produce a binding partner (e.g., 
an anti-idiotype antibody) that binds to all members of the set. Thus, the invention is also 
directed to these amino acid consensus sequences and to nucleotide sequences encoding the 
consensus sequences. 

15 The invention is also directed to anti-idiotype antibodies that bind to the antigen-binding 

region of an antibody encoded the antibody genes of Set I, Set n. Set HI, Set TV, Set V, Set Via, 
Set VIb, Set Vic, Set VId, Set Vie, Set VII, or Set VIE. Preferably, the antibody genes are 
selected from the group consisting of Set H, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set 
Vie, Set Vn, and Set Vm, or Set H, Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, 

20 and Set VH, or Set H, Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId. Set Vie, and Set VIE, or 
Set I, Set n, Set m. Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, and Set Vn. Since 
these anti-idiotype antibodies bind to the antibody binding site of the antibodies that are the B cell 
receptor of a B-CLL cells from a significant portion of B-CLL patients with the aggressive form 
of the disease, the anti-idiotype antibodies can be used in various diagnostic and treatment 

25 methods for B-CLL. 

The anti-idiotype antibodies of these embodiments can be made by standard methods, 
e.g., screening a phage display library, or producing a hybridoma making monoclonal antibodies 
against the antigen binding site of the antibodies encoded by the various B-CLL gene sets 
described above. As such, these anti-idiotype antibodies can be from any vertebrate species but 

30 are preferably mouse antibodies, human antibodies, or humanized antibodies. Such antibodies 
can be made by known methods without undue experimentation. The present invention also 
includes hybridomas that produce the above anti-idiotype antibodies. 

In related embodiments, the invention is directed to bispecific antibodies comprising the 
binding site of any of the above-described anti-idiotype antibodies and a binding site that binds to 

35 another B cell antigen. The B cell antigen can be any antigen on fhe B cell, such as a signal- 
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txansducing antigen (either surface or intracellular), or a surface antigen. It is expected that, in 
many cases, the bi-specific antibodies having a binding site to a B cell surface antigen would bind 
to the B ceil more tightly than an antibody with two anti-idiotype binding domains, since anti- 
idiotype antibodies can be of low avidity. The bi-specific antibodies having a binding site to a 
5 signal-transducing antigen would be expected to expedite the sign£ding pathway, such as a 

terminal differentiation pathway or an apoptotic pathway, thus expediting the elimination of a B 
cell contributing to the B-CLL disease. 

The above anti-idiotype antibodies can also be combined in a mixture that provides the 
antibodies directed to the binding sites from more than one set. This mixture can include as many 
10 anti-idiotype antibodies as desired, including those any combination, or all of the sets. The latter 
mixture would be effective in diagnosis or treatment methods for all of the sets, rather than just 
one set. 

When used for treatment methods, the above-described anti-idiotype antibodies or 
mixtures thereof would be in a pharmaceuticeilly acceptable excipient. 

15 The above-described anti-idiotype antibody compositions can be formulated without 

undue experimentation for administration to a mammal, including humans, as appropriate for the 
particular application. Additionally, proper dosages of the compositions can be determined 
without undue experimentation using standard dose-response protocols. 

Accordingly, the compositions designed for oral, lingual, sublingual, buccal and 

20 intrabuccal administration can be made without undue experimentation by means well known in 
the art, for example with an inert diluent or with an edible carrier. The compositions may be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the pharmaceutical compositions of the present invention may be incorporated 
with excipients and used in the form of tablets, troches, capsules, elixirs, suspensions, syrups, 

25 wafers, chewing gums and the like. 

Tablets, pills, capsules, troches and the like may also contain binders, recipients, 
disintegrating agent, lubricants, sweetening agents, and flavoring agents. Some examples of 
binders include microcrystalline cellulose, gum tragacanth or gelatin. Examples of excipients 
include starch or lactose. Some examples of disintegrating agents include alginic acid, com starch 

30 and the like. Examples of lubricants include magnesium stearate or potassium stearate. An 

example of a glidant is colloidal silicon dioxide. Some examples of sweetening agents include 
sucrose, saccharin and the like. Examples of flavoring agents include peppermint, methyl 
salicylate, orange flavoring and the like. Materials used in preparing these various compositions 
should be pharmaceutically pure and nontoxic in the amounts used. 
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In preferred embodiments, the anti-idiotype antibody compositions of the present 
invention can easily be administered parenterally such as for example, by intramuscular, 
intrathecal, subcutaneous, intraperitoneal, or, in the most preferred embodiments, intravenous 
injection. Parenteral administration can be accomplished by incorporating the compositions of 
5 the present invention into a solution or suspension. Such solutions or suspensions may also 
include sterile diluents such as water for injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic solvents. Parenteral formulations may also 
include antibacterial agents such as for example, benzyl alcohol or methyl parabens, antioxidants 
such as for example, ascorbic acid or sodium bisulfite and chelating agents such as EDTA. 

10 Buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as 
sodium chloride or dextrose may also be added. The parenteral preparation can be enclosed in 
ampules, disposable syringes or multiple dose vials made of glass or plastic. 

Rectal administration includes administering the pharmaceutical compositions into the 
rectum or large intestine. This can be accomplished using suppositories or enemas. Suppository 

15 formulations can easily be made by methods known in the art. For example, suppository 

formulations can be prepared by heating glycerin to about 120° C, dissolving the composition in 
the glycerin, mixing the heated glycerin after which purified water may be added, and pouring the 
hot mixture into a suppository mold. 

Transdermal administration includes percutaneous absorption of the anti-idiotype 

20 antibody composition through the skin. Transdermal formulations include patches (such as the 
well-known nicotine patch), ointments, creams, gels, salves and the like. 

The present invention includes nasally administering to the mammal a therapeutically 
efifective amount of the composition. As used herein, nasally administering or nasal 
administration includes administering the composition to the mucous membranes of the nasal 

25 passage or nasal cavity of the patient. As used herein, pharmaceutical compositions for nasal 
administration of a composition include therapeutically effective amounts of the composition 
prepared by well-known methods to be administered, for example, as a nasal spray, nasal drop, 
suspension, gel, ointment, cream or powder. Administration of the anti-idiotype antibody 
composition may also take place using a nasal tampon or nasal sponge. 

30 In other embodiments, the invention is directed to peptide antigens that bind to the 

antigen-binding region of an antibody encoded by antibody genes firom Set I, Set n. Set HI, Set 
IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set VU, or Set Vm. Preferably, the 
antibody genes are selected firom the group consisting of Set II, Set IV, Set V, Set Via, Set VIb, 
Set Vic, Set VId, Set Vie, Set VU, and Set VTH, or Set II, Set IV, Set V, Set Via, Set VIb, Set 

35 Vic, Set VId, Set Vie, and Set Vn, or Set n. Set IV, Set V, Set Vfe, Set VIb, Set Vic, Set VId, Set 
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Vle, and Set Vm, or Set I, Set H, Set HI, Set IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set 
Vie, and Set VII, Such peptide antigens can be made by well-known methods, e.g., phage display 
library or high-density peptide library, without undue experimentation. 

As used herein, the term "peptide antigen" includes peptide mimetics, also known as 
5 peptidomimetics, which retain the same binding abilities as the analogous amino acid peptide. 
Peptide mimetics are peptides comprised of amino acid analogs, such as D-amino acids, that are 
more resistant to protease degradation than their L-amino acid peptide counterparts. Various 
peptide mimetics are known in the art, and any peptide mimetic can be produced without undue 
experimentation . 

10 As is analogous with the anti-idiotype antibodies, these peptide antigens can be prepared 

as a mixture, in order to provide a diagnostic or therapeutic reagent useful for several, or all of the 
B-CLL sets. Also as with the anti-idiotype antibodies, the peptide antigens can also be usefully 
provided in a pharmaceutically acceptable excipient, for therapeutic applications, preferably for 
parenteral administration. 

15 In further embodiments, the invention is directed to aptamers that bind to the smtigen- 

binding region of an antibody encoded by antibody genes from Set I, Set 11, Set HI, Set IV, Set V, 
Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set VII, or Set Vm. Preferably, the antibody genes 
are selected from the group consisting of Set H, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, 
Set Vie, Set VH, and Set VIQ, or Set H. Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set 

20 Vie, and Set VH, or Set H, Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, and Set 
vm, or Set I, Set H, Set m. Set IV, Set V, Set Via, Set VIb, Set VIc, Set VId, Set Vie, and Set 
Vn. As is known, aptamers are single stranded oligonucleotides or oligonucleotide analogs that 
bind to a particular target molecule, in this case an antibody binding site. Thus, aptamers are the 
oligonucleotide analogy to antibodies. However, aptamers are smaller than antibodies, generally 

25 in the range of 50-100 nt. Their binding is highly dependent on the secondary stracture formed by 
the aptamer oligonucleotide. Both RNA and single stranded DNA (or analog), aptamers are 
known. Thus, these aptamers are analogous to the anti-idiotype antibodies and the peptide 
antigens previously discussed. As such, they can also be provided as a mixture of two or more, in 
order to have a reagent that can be utilized with more than one set of patients. They can also be 

30 provided in a pharmaceutically acceptable excipient, for therapeutic purposes, preferably for 
parenteral administration. 

In some embodiments, the anti-idiotype antibody, peptide antigen, aptamer, or mixtures 
of these as previously described can usefully be functionalized or derivatized. One useful 
derivitization includes a cellular toxin. Such reagents are useful in a "magic bullet" approach to 

35 B-CLL therapy, where the toxin would be expected to kill only the B-OLL cell that the anti- 
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idiotype antibody, peptide antigen, or aptamer bound. Several cellular toxins known in the art for 
these embodiments can be used for this approach, including radioactive moieties, ricin, and 
chemotherapeutic agents. 

In other embodiments, the anti-idiotype antibody, peptide antigen, aptamer, or mixtures of 
5 these as previously described can usefully be further functionalized to comprise a detectable 
moiety, such as a fluorophore, or an enzyme that can be treated with a substrate to produce a 
colored reaction product. Non-limiting examples of the latter enzyme is horseradish peroxidase 
and alkaline phosphatase. Such labeled anti-idiotype antibody, peptide antigen, aptamer, or 
mixtures can be used for diagnostic purposes, for example in labeling the B-CLL cells for 
10 fluorescence activated cell sorter analysis or for histological observation of the cells. These 
methods are more fiiUy described below. 

In additional embodiments, the invention is directed to multimeric molecules comprising 
at least a first and a second binding site, the first binding site binding to the antigen-binding 
region of an antibody encoded by antibody genes from Set I, Set U, Set in. Set IV, Set V, Set Via, 
15 Set VIb, Set Vic, Set VId, Set Vie, Set VII, or Set VHt, and the second binding site binding to 

either (a) the same antigen-binding region of an antibody as the first binding site or (b) another B- 
cell antigen. Preferably, the antibody genes are selected from the group consisting of Set II, Set 
rV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set VH, and Set Vm, or Set H, Set IV, Set 
V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, and Set VH, or Set H, Set IV, Set V, Set Via, Set 
20 VIb, Set Vic, Set VId, Set Vie, and Set Vm, or Set I, Set H, Set m. Set IV, Set V, Set Via, Set 
VIb, Set Vic, Set VId, Set Vie, and Set VH. By providing multiple binding sites to a particular 
set, these multimeric compositions would be expected to bind more effectively than the single 
binding site peptide antigens or aptamers, or the double binding site anti-idiotype antibodies, as 
described above. In preferred embodiments, the multimeric molecules of these embodiments 
25 comprise more than five binding sites. These multimeric molecules can be made by the skilled 
artisan without undue experimentation. 

In some embodiments, all of the binding sites of the multimeric molecule bind to the 
antigen-binding region of an antibody encoded by antibody genes from Set I, Set II, Set EI, Set 
IV, Set V, Set Via, Set VIb, Set Vic, Set VId, Set Vie, Set Vn, or Set Vm. These binding sites 
30 can be directed to one epitope, to more than one epitope of the antigen-binding region, or to 
antigen-binding regions of more than one set» 

In these multimeric molecules, the binding sites can be all antibody binding sites, all 
peptide binding sites, all aptamer binding sites, or combinations thereof. 

More generally, the invention is further directed to an isolated and purified preparation of 
35 a combination of a light chain antibody gene and a heavy chain antibody gene, where the g^ne 
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family members of the light chain antibody gene and the heavy chain antibody gene are present in 
B cells of two or more patients, where the antibody chains of the B cells also share the seime 
isotype, JH, D and JL regions, and where the B cells are lymphoproliferative in the patient, or 
where the patient has an autoimmune disease involving the B cells, 
5 The discovery that B-CLL patients can be classified into sets having common antibody 

chains raises the possibility that other lymphoproliferative or autoimmune diseases involving B 
cells can also be classified into sets, where each set of patients shcire B cells that are involved in 
the disease with the same antibody genes. The instant disclosure provides evidence for this, since 
a patient in Set I has an immunocytoma, a patient in set n has a small cell lymphocytic lymphoma 
10 (SLL), and a patient in set Via has a marginal zone lymphoma (SMZL) (FIG. 1). It is also highly 
probable that other B-CLL sets exist. 

Preferred lymphoproliferative disorders within these embodiments include Hodgkin's 
disease, non-Hodgkin's lymphoma, Burkitt's lymphoma, myeloma, a monoclonal gammopathy 
with antibody-mediated neurologic impairment, a monoclonal gammopathy of unknown 
15 significance, and a monoclonal lymphocytosis of undetermined significance. Preferred 
autoimmune diseases within these embodiments include systemic lupus erythematosus, 
myasthenia gravis. Grave's disease, type I diabetes mellitus, autoimmune peripheral neuropathy, 
and autoimmune hemolytic anemia. 

As previously discussed, the above compositions are useful for various diagnostic and 
20 therapeutic methods that are envisioned as part of the invention. 

Thus, in some embodiments, the invention is directed to methods of 

(a) determining whether a patient with B cell chronic lymphocytic leukemia (B-CLL) has 
a form of B-CLL susceptible to treatment directed to eliminating idiotype-specific B cell receptor- 
bearing B-CLL cells, or 

25 (b) follovmg the progression of treatment of B-CLL in a patient having a form of B-CLL 

susceptible to treatment directed to eliminating idiotype-specific B cell receptor-bearing B-CLL 
cells. In these embodiments, the methods comprise determining whether the B cell receptors on 
the B-CLL cells have an idiotype encoded by antibody genes from Set I, Set n. Set IH, Set IV, Set 
V, Set Via, Set VIb, Set Vic, Set VId, Set Vie. Set VH, or Set Vm. A determination that the B 

30 cell receptors have the specified idiotype at once establishes that the patient apparently has an 
aggressive form of B-CLL, and that the B-CLL can be treated using the anti-idiotype, peptide, 
aptamer, mixtures, or multimeric molecules described above, particularly those conjugated to a 
cellular toxin. Additionally, by continual monitoring of the idiotype of the B cells from the 
patient, one can follow the progress of treatment, since an effective treatment would exhibit a 

35 decreasing amount of B cells having an idiotype from the B-CLL set. No B cells having an 
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idiotype from the B-CLL set essentially means that the patient is in reniission or cured of the B- 
CLL. 

It can be seen, then, that it is useful to monitor progression of the treatment by 
quantifying the B cells having an idiotype from the B-CLL set, since a decreasing quantity of the 
5 B cells indicates an effective treatment, while an increasing quantity of the B cells indicates an 
ineffective treatment. 

In these methods the determination step can be by any means known in the art. 
Nonlimiting examples include (a) amplification of idiotype-determining regions of the antibody 
genes or mRNA, e.g., by polymerase chain reaction, and evaluating whether the amplified regions 
10 are amplified from the B-CLL set in question; (b) sequencing the Eucnplified regions; (c) evaluating 
whether the amplified regions hybridize with equivalent regions from the B-CLL set in question; 
(d) evaluating whether the patient has circulating antibodies with an idiotype encoded by the 
antibody genes from the B-CLL set in question; (e) evaluating whether the patient has antibodies 
that bind to a binding agent (e.g., an anti-idiotype antibody, a peptide antigen, or an aptamer, as 
15 described above, preferably comprising a detectable moiety) specific for the idiotype encoded by 
the antibody genes from the set in question; or (f) niixing a labeled anti-idiotype antibody, peptide 
antigen, or aptamer with lymphocytes of the patient and determining whether lymphocytes that 
bind to the composition are present, e.g., using a Coulter counter or a cell sorter. 

The above methods can be used with a B-CLL patient at any stage of the disease, 
20 including in a pre-leukemic, early leukemic, frank leukemic state. Furthermore, the B-CLL cells 
can be obtained from the blood, the bone marrow, the spleen, and/or the lymph nodes, depending 
on the results of initial diagnosis and the stage of the disease. 

The present invention is also directed to methods of treating a patient having B-CLL 
caused by B cells comprising antibody genes from Set I, Set H, Set HI, Set IV, Set V, Set Via, Set 
25 VIb, Set VIc, Set VId, Set Vie, Set Vn, or Set Vm. The methods comprise admimstering to the 
patient the above described anti-idiotype antibody, peptide antigen, aptamer, or mixture as 
previously described, in a pharmaceutically acceptable excipient. 

Although the anti-idiotype antibody, peptide antigen, aptamer, or mixture by themiselves 
could be effective in eliminating the B cells, because they could set off an apoptotic cascade in the 
30 cells, it is preferred that the anti-idiotype antibody, peptide antigen, aptamer, or mixture also 
comprise a cellular toxin, as described above, that can directly kill the cell. 

Additionally, the invention is directed to methods of identifying other B-CLL sets. The 
methods comprise identifying the VH, D, JH, VL, and JL classes of antibody genes present on B- 
CLL cells, where the same classes are all present in more than one B-CLL patient It is 
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Introduction. 

The B-lymphocyte clone expanded in chronic lymphocytic leukemia (B-CLL) expresses 
low levels of surface membrane Ig, the B cell antigen receptor (BCR). The genetics of this Ig 
have clinical relevance, as patients v^ith a clone whose Ig variable (V) region has no or few 
mutations have a significantly worse outcome than those with significant numbers of Ig V 
mutations (Damle et al., 1999; Hamblin et al., 1999). The biology underlying this association is 
unclear. 

Several lines of evidence support a role for the BCR in the evolution of B-CLL (reviewed 
in Chiorazzi and Ferrarini, 2003). The distribution of individual IgVn in B-CLL clones differs 
from that found in normal cells (Pais et al., 1998), with an increased frequency of V//i-69, Vh4-34, 
and V„3-07 (Pais et al., 1998; Schroeder and Dighiero, 1994; Johnson et aL, 1997). In addition, 
the distribution of mutations among B-CLL cases using these specific Vh genes is selectively 
biased (Pais et al., 1998; Schroeder and Dighiero, 1994; Kipps et al., 1989). 

Recently two subgroups of B-CLL cases with remarkable similarity of the entire BCR (V 
regions of the H and L chain) were identified (Tobin et al., 2003; Ghiotto et al., 2004). Although 
these findings are provocative, they have been considered rare and potentially anomalous, since, 
in one instance the clones expressed IgG (Ghiotto et al., 2004) and in the other geography and 
ethnicity may be relevant (Tobin et al., 2002). This report describes another eight groups of B- 
CLL patients that express BCRs of strikmgly similar prinuury structure defmed by highly similar 
Ig V regions in the H and L chains and, in particular, distinct H and L CDR3 configurations. 
Thus, a significant jBraction of B-CLL clones derive from B-lymphocytes with constrained antigen 
binding sites that could recognize individual, discrete antigen(s) or classes of structurally similar 
epitopes. 

Materials and Methods 

IgV gene sequencing. VhDJh and ViJl sequences were determined by previously 
described methods (Pais et al., 1998; Ghiotto et al., 2004). 

Database Searches. B-CLL Ig H chain V amino acid sequences from our collection 
(n=255) and the public databases (n=197) were subjected to BLAST searches of both nucleotide 
and protein databases to identify similar sequences* The criteria used to define "Sets" of similar 
rearranged VhDJh were: A) use of the same Vh, D, and Jh germline genes, B) use of the same D 
segment reading frame and position relative to the Vh, plus or minus one codon, and C) an amino 
acid siinilarity within the HCDR3 of >60% identity. In addition, all B-CLL Ig H protein 
sequences were aligned and clustered using the ClustalW alignment algorithm. Sequences 
clustering tightly were visually inspected for similarity. All of these searches used the complete 
VhDJh and as such were weighted toward sequences that used the same Vh gene. To identify 
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sequences with similar HCDR3 but different Vh genes, CDR3 motifs from the various sets were 
used to search the public databases with the Proteinlnfo search engine 

(^http://prowLrockefeller.edu/) . The criteria for the members of Set V were altered to permit the 
use of different IgV^ genes that were members of the same IgVff clan, while retaining the criteria 
5 for the rearranged VlJl- Use of the same specific IgVi gene and >85% LCDR3 identity was 
required for the inclusion of a companion rearranged VlJl in a Set. 

538 sequences from CDS"^ and CDS" peripheral B-lymphocytes (Tobin et al., 2002; 
Geiger et al., 2000) were downloaded from the public database. These 538 sequences were 
independently compared to the translated databases using tblastn on the BlastMachine at the 
10 AMDeC Bioinformatics Core Facility at the Columbia Genome Center, Columbia University. 

Detailed nucleotide and amino acid sequence alignments of the junctional regions and 
complete protein sequence alignments of the sequences described here are provided in the 
Figures. 

Results and Discussion 

15 Identification of subgroups of B-CLL patients with highlv restricted VmPJ^j segments and 

shared HCDR3 configurations. Each B-CLL-derived VhDJh sequence in our database was 
compared with every B-CLL sequence in our collection (n = 255) as well as with those in the 
public Ig V gene databases (n = 197) using nucleotide and protein sequence BLAST. In addition, 
all available B-CLL H chain V region sequences were phylogenetically grouped using the 

20 ClustalW method; sequences that clustered together were further analyzed for HCDR3 sequence 
similarity. These screening methods identified Sets of sequences (Table 1) consisting of the same 
IgVff with highly similar HCDR3 resulting firom identical D (when identifiable) and Jh segment 
use, D segment reading frame, similar D segment position relative to IgV^y and HCDR3 length, 
and significant (^60%) amino acid sequence identity. 
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Table L Sets of B-CLL cases that share Ig V region genes and have a high degree of similarity in 
HCDR3. 



Set 


Lit 
B-CLL' 


Public 
B-CLL 


Pub. 
other 


Isotype 


■\7" 

Vh 


Vh Mutation % 


D 
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Vl 
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Jl 


max 


min 
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T 
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i 
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0.0 
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2 
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4-34 


3.1 


2.0 


2.7 


5-5 
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kA17 (3/3) 


kI/k2 


in 
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2 


2"= 


IgM 


3-21 


2.4 


0.0 


1.4 


ND 


6 
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^3 
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0.6 


0.0 


0.0 


3-16 


3 


kA27 (2/2) 
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0 


4 


0 


IgM 


1-69 


0.3 


0.0 


0.3 


3-10 


6 


Xl-16(l/l) 


^l 


via 


4 


2 


1" 


m 


1-02 


0.3 


0.0 


0.0 


6-19 


4 


K 012/2 (4/4) 


k1/k2 


VIb 


2 


4 


0 


IgM 


1-03 


2.0 


0.3 


0.8 


6-19 


4 


K 012/2(3/3) 


k1/k2 


Vic 


1 


0 


0 


IgM 


1-18 


1.2 


1.2 


1.2 


6-19 


4 


K 012/2(1/1) 


Kl 


VId 


0 


2 


0 


IgM 


1-46 


0.0 


0.0 


0.0 


6-19 


4 


0/0 




Vie 
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6 


0 


IgM 


5-51 


2.7 


0.0 


0.2 


6-19 
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K 012/2(1/2) 


k2 


vn 
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IgM 
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0.0 


0.0 


0.0 


3-3 


4 
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k4 


vm 


3 


0 


0 


IgM 


1-69 






0.0 


2-2 


6 


kL6 


k3 



* Internal B-CLL 



5 immunocytoma, accession Y09249 

*^ small lymphocytic leukemia, accession AF299104, and elderly normal, accession AFl 74100 

^ anti-cardiolipin antibody, accession AF460965 

^ small marginal zone lymphoma, accession AJ487492 

10 Three subsets of Set VI (Via, VIb, and Vie) contained sequences that utilized different 

IgVfj genes but used the same D and Jh segments, the same Vk:, and had highly similar HCDR3 
configurations. Tlnerefore, we used the HCDR3 motif common to these three subsets to search 
pubUc databases for additional sequences with the same HCDR3 configuration potentially 
associated with a different IgVn segment. This search was not restricted to B-CLL sequences. 

15 The approach confirmed the previously identified subsets and identified two additional subsets of 
Set VI (Vic and VId). 

The public database searches identified 21 VhDJh sequences, belonging to one of the 
eight individual Sets, bringing the total number of sequences among these Sets to 45, 
Interestingly, only two of the 21 sequences culled fixjm the public databases were not derived 

20 from B-CLL cells. These two were from an anti-cardiolipin antibody producing B cell (Set IV) 
and from a splenic marginal zone lymphoma (Set Via). This distribution of similar sequences is 
particularly striking since, at the time of this search, the public databases contained only 197 Ig H 
chain V region sequences from B-CLL patients (excluding those from our laboratories) out of a 
total of over 8,500 H chain V region sequences (search of Entrez with terms "human 

25 immunoglobulin heavy chain variable" produced 8,874 hits in the nucleotide database and over 
6,183 hits in the protein database on 12/16/03). 
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Pairing restricted VrL rearrangements with V^DJm seimients in Sets. VJl sequences 
corresponding to the shared VhDJh of the 5 Sets were available for most of our B-CLL cases and 
for a few of those identified in the public databases. Remarkably, the available IgVi were highly 
conserved within Sets and the corresponding Jl were very restricted (Table 1 and FIG. 2). Six of 
5 the eight Sets with available L chains expressed the k isotype. 

IgV gene mutation status and isotvpe restrictions of individual Sets. Most of the /gV// 
sequences in each Set differed by <2.0% from the most similar germline gene, with the exception 
of Set n in which the median level of mutation was 3.0 %. Notably, the deduced protein 
structures in those sequences that were considered "mutated" using the typical 2% threshold 
10 differed from the germline by relatively low levels. Only one sequence, from Set n (CLL ID47, 
FIG. 2), differed by more than 5% from its germline counterpart. The corresponding IgVi in each 
Set exhibited low levels of mutation; in some cases displayed <2.0% difference while Vh had 
>2% difference from the germline sequence (Table 1 and FIG. 2). 

The H chain isotype was the same among members of a Set. All Sets expressed IgM, 
15 except for Set IV that consisted of IgG"*^ cases, similar to a patient group reported previously 
(Ghiotto et al., 2004). 

H and L CDR3 characteristics of the individual Sets. We identified trends in the 
chemical, structural, or functional nature of the residues that comprise the H and L CDR3s, and in 
particular their Vh-D and D-Jh junctions. For example, the D segments in the HCDR3s of these 
20 Sets were read in the hydrophobic and stop reading frames more often than in normal (Zemlin et 
al., 2003) and B-CLL (Pais et al., 1999) cells. For all cases in Set VI, the 06-19 segment is read 
in a non-productive reading frame. However, the germline stop codon, located in the region of 
overlap with the terminal IgVa sequence, was trimmed, allowing productive rearrangements with 
the segment (FIG. 8). 
25 Also of note was the repeated occurrence of certain non-germline encoded amino acids 

within D segments in some of the Sets. For example in all members of Set Vm, a change to M is 
found at the 3*end of the D segment (FIG. 5), a position that is not known to be polymorphic. 
Three of 7 sequences in Set V had an R to Q change within the D3-10 segment that is also not 
listed as polymorphic (FIG, 6). In 4 of 5 cases in Set II, P replaced A in the portion of HCDR3 
30 encoded by the canonical D5-5 segment. While this is most likely a polymorphism of the D 

segment rather than a common mutation, the last of the 5 sequences in this set (CLL ID47) also 
deviates from the canonical D5-5 sequence at this codon, substituting a D (FIG. 7). Thus even if 
these amino acid changes represent polymorphisms, their relative consistency within each Set 
suggests a selection for these residues. 
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Members of several Sets have common junctional residues that were not templated by any 
known germline gene segments and therefore presumably arose from trimming and/or addition 
during recombinational assembly. The sequences in Set IV all contain a pair of Gs at the Vh-D 
junction and an N at the D-Jh junction (FIG. 4). A very similar Vh-D junctional finding exists in 
5 Set vni (FIG. 5). All sequences in Set II contain an aromatic residue at the Vh-D and a pair of 
basic residues (R or K) at the D-Jh junction (FIG. 7). 

Other trends in the composition of the H and L CDR3s are found in the other Sets. These 
and the fme details of the nucleotide and amino acid sequences of the VhDJh and VJl junctions 
for each Set are shown and discussed in the Supplemental data (see FIGS. 4-8). 

10 Structural similarities of the BCR among^ members of the Sets. The deduced VhDJh and 

VlJl protein sequences for each member of the stereotyped Sets are presented in FIGS. 2 and 3. 
Because most ncxembers of the Sets use the same IgYn^ primarily in an unmutated form, associated 
with the same £> and Jff segments and since these rearrangements are virtually always paired with 
an identical /gV^ that is restricted in its linked Jl> the primary structural features of the entire BCR 

15 of each Set are likely remarkably similar. Furthermore, the amino acid sequences of HCDRl, 
HCDR2, LCDR 1, and LCDR2 of members of the individual Sets are extremely similar, if not 
identical (e.g.. Sets IV, V, VEX, and the Set VI subsets). In Set II, some amino acid differences 
exist in these regions due to somatic mutation. 

These data indicate a much more marked constraint on the primary structure of the BCR 

20 in B-CLL than previously appreciated. They also indicate that this principle occurs in a sizeable 
number of patients. Collectively, -12% (31 of 255: 22 from this study, 5 from our previous study 
(Ghiotto et al., 2004), and 4 that match another described set (Tobin et al., 2002; 2003)) of all of 
sequences in our internal laboratory B-CLL database and -20% (27 of 131) of those with 
unmutated /gV belong to one of the eight stereotyped Sets described here or one of the two patient 

25 groups mentioned above (Tobin et al., 2002; 2003; Ghiotto et al., 2004). Approximately the same 
overall frequency (-12%) was encountered among the sequences from the public databases (21 of 
197), although the proportion of the public B-CLL sequences that are unmutated was not 
detemndned. Most of the rearrangements in these Sets lack or have few somatic mutations, and 
even those whose Vh surpass the 2% threshold commonly used as the criterion to define 

30 significant IgV gene mutations (Pais et al., 1998; Schroeder and Dighiero et al., 1994) are only 
slightly above that level. This suggests that restricted BCR structure is primarily a feature of 
those patients with the worse clinical course and outcome (Damle et al., 1999; Hamblin et al., 
1999). It appears that 1 of 5 B-CIJL cases with unmutated BCRs fit into one of fliese defined 
Sets. Additional Sets will likely be imcovered as more Ig V region sequences ate defined in B- 

35 CLL, and all unmutated cases may be similar to one of a discrete number of archetypal Sets. 
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Although Sets IV, V, VII, and Vm use unmutated 1-69, they differ from previously described i- 
69-expressing B-CLL cases that have restrictions in specific D and Jh segments associations (Fais 
et al., 1998; Johnson et al., 1997). These differences include Jh {JhS vs. Jh6 in Set I), D (D2 vs. 
D3 family and VkL6 with an extremely short LCDR3 in Set VUI), and L chain (k vs. K in Set V) 
5 gene use. 

Initial studies that considered only IgVn or VhDJh (Fais et al., 1998; Schroeder and 
Dighiero, 1994; Johnson et al., 1997; Chiorazzi and Ferrarini, 2001) pointed toward limited 
structural diversity in the antigen-binding sites of B-CLL. However, our results are much more 
striking because of the remarkable similarity of the sequences within a Set and the virtual 

10 mathematic impossibility that this similarity arose by chance. If gene segment use in B-CLL was 
random, the probability of finding the same combination of VhDJh and VJl segments in 
independent leukemic (or normal) B cells would be >1 x 10'^. Therefore, one would not expect to 
identify two B-CLL patients with BCRs comprised of the same VhDJhA^Jl until >1 million cases 
were analyzed. This calculation is conservative since it does not account for diversity at the Vh- 

15 D, D-Jh, and Vl-Jl junctions that can be quite extensive (potentially exceeding 1x10'^ and 
reaching 1 x 10"*^), although receptor editing and revision could limit these possibilities 
somewhat. Nevertheless, the level and frequency of BCR structural restriction in clusters of 
patients reported here is extraordinary and appears to be higher than any other B or T cell 
lymphoproliferative disorder reported to date. 

20 Finding similar Ig H chain V region sequences by homology searches of the public 

databases is not, in itself, completely surprising because some IgVfj are expressed in a biased 
fashion and ~6,60O different Vh-D-Jh combinations can occur. Because the databases contain 
more than that number of Ig H chain V region sequences, identifying the same recombined gene 
segments is not improbable. When we analyzed 538 CDS* and CDS' B cell-derived H chain V 

25 region sequences, we identified many pairs of similar sequences and some groups of similar 

sequences. However these groups derived from B cells of diverse sources, as would be expected 
if the similarities were the product of random chance. In contrast, the similarity to a given B- 
CLL-derived sequence detected in our database comparisons arose almost exclusively from other 
B-CLL sequences (19/21) or other lymphoproliferative disorders (1/21), even though the entire 

30 database was searched. Only one identified sequence was from a non-B-CLL clone and that 

coded an autoantibody (Table I and FIG. 2). Although the proper normal B cell repertoire against 
which B-CLL clones should be compared remains an open question (Chiorazzi and Ferrarini, 
2003), these results demonstrate that sequence sets of restricted cellular origin are not a 
generalized phenomenon in the public database. 
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Therefore, the development of B-CLL must involve B cell clones with restricted IgV 
and/or BCR structure. While it seems unlikely that the expression of particular BCR gene 
combinations could be the sole promoting factor for leukemogenesis, a strong inherent bias in 
gene segment association and VhDJhA^lJl pairing in the B cell population that gives rise to B- 
5 CLL cannot be formally excluded, especially since the cell of origin for B-CLL is still uncertain 
(Chiorazzi and Ferrarini, 2003). Although evidence exists in mice for biases in the recombination 
of particular Ig V gene segments prior to antigen experience (Seidl et al., 1997), the extent of 
restriction imposed by recombination biases at both the H and L chain V gene loci in those 
instances, especially at the V - (D) - J junctions, are not as severe as in the Sets described here. 

10 To our knowledge, there is no known subpopulation of human B cells in which the frequency of 
similar rearrangements, independent of antigen selection, is as great as among these B-CLL cases. 

Therefore, antigen selection probably has a strong restrictive influence on the 
transformation of a normal B-lyraphocyte to a B-CLL cell. A simple model would postulate that 
the transforming event is coupled with antigen specificity, i.e., an individual B-lymphocyte from a 

15 highly diverse population could bind and internalize a transforming agent (e.g., vims) via its 

BCR. Although this seems unlikely, such a mechanism has been implied for B-CLL (Mann et al., 
1987). 

Alternatively, antigen could be a promoting factor for transformation, selecting specific 
clones for expansion from an initially diverse population of B-lymphocytes and fostering their 

20 development to and in the transformed state (Chiorazzi and Ferrarini, 2003). This would be the 
case if the B-CLL-susceptible cell population were pre-selected for antigen-reactivity, and 
therefore BCR structure, by exposure to distinct antigens or classes of antigens during their 
development. These clones could differ among patients, especially if the selecting antigens were 
foreign or autologous and possibly polymorphic. From within these clonal expansions, one 

25 member could develop an initial transforming lesion that would promulgate the leukemogenic 
cascade independent of antigen. 

Finally, the initial transforming events could occur at random within a diverse B cell 
population or a previously antigen-selected population, and the subsequent nurturing of the 
transformed clone to clinical B-CLL could require ongoing BCR engagement by antigen 

30 (Chiorazzi and Ferrarini, 2003). Recently, clonal expansions of B cells with phenotypic 

characteristics of B-CLL were found in normal elderly individuals (Rawstron et al., 2002; Ghia et 
al., 2004. The clinical relevance of these clones is not established. However, they may represent 
clones that have some of the genetic lesions of B-CLL but lack BCR specificities that would 
result in sufficient ongoing stimulus to mature them into clinical B-CLL. 
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The remarkable protein similarity of the entire BCR among members of each Set (FIGS. 2 
and 3) suggests that they could recognize the same or similar antigens. While the nature of the 
antigen(s) cannot be directly deduced from the Ig sequences presented here, there are several 
reasons to suspect that they are autoantigens or carbohydrates possibly derived from bacterial or 
5 viral coats, or a combination of the two. 

VhI'69 (Sets I, U, and HI) and Vh3-21 (previously described Set in Tobin et al., 2002; 
2003) are enriched among rheumatoid factors (Silverman et al., 1988; He et al., 1995). Vh4'34 
(Set n) is used in every case of monoclonal cold agglutinin disease (Pascual et al., 1992) and in 
autoimmune conditions. Indeed the inherent autoreactivity of this Vh segment elicits a major 

10 inhibitory process by the immune system that keeps 4-34^ B cells from diversifying into high 
affinity, isotype-switched B cells (Pugh-Bemard et al., 2001). The anti-cardiolipin antibody 
identified as a member of Set IV implies that the other members of that Set may be specific for 
cardiolipin or DNA, since some antibodies to the former react with the latter (Kumar et al., 2003). 
In addition, restricted VhDJh and/or VJl gene segments are features of B cells that produce anti- 

15 carbohydrate mAb in human (Scott et al., 1989) and mouse (Potter, 1977). 

Characteristic junctional residues are also a feature of anti-carbohydrate mAb and 
autoantibodies and basic junctional residues, as seen in Sets 11, IV, and Vie (FIG. 2), often 
indicate reactivity with acidic targets such as DNA (Radic and Weigert, 1994). The synthesis of 
autoreactive Ig/BCR molecules by many B-CLL clones (Sthoeger et al., 1989; Borche et al., 

20 1990) supports a link between the unique BCR structural features of these Sets and 
autoantibodies. 

The non-B-CLL Ig sequences that matched these B-CLL stereotypes may give insight 
into the identity of the B-CLL progenitor cell(s). One of those two derived from a splenic 
marginal zone lymphoma (SMZL; Set Via, FIG, 2) and the other from an autoantibody-producing 

25 B cell (Set IV, FIG. 2). Interestingly, normal MZ B cells produce mAb that can recognize 

thymus-independent type II antigens and autoantigens (Bendelac et al., 2001). In addition, the Ig 
V region repertoire of murine MZ B cells is very restricted in gene segment use and structure that 
requires mtact BCR signal transduction to develop (Martin and Keamet, 2000). MZ B cells 
appear to be progenitors for gastric MALT lymphoma (Isaacson, 1999) and have been proposed 

30 as precursors of B-CLL cells (Chiorazzi and Ferrarini, 2003). If one infers common antigenic 

reactivity based on the similar sequences within a Set, a significant fraction of B-CLL cases, and 
in particular those with unmutated genes, produce mAb that recognize one of a limited, 
discrete array of antigens or epitopes. With such an interpretation, some B-CLL cases may 
resemble gastric MALT lymphoma regarding the role of antigenic drive (in that instance, H. 
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pylori) in the promotion of malignancy. The stereotyped Ig molecules reported here might be 
valuable probes to identify antigens that drive the leukemogenic process in B-CLL. 

Finally, these Sets of stereotyped Ig molecules may serve as therapeutic targets on B-CLL 
cells. A conceptual drawback to targeting the BCR as a tumor-specific antigen has been the 
apparent need to create an individualized reagent for each patient. However, since our data 
indicate that there is potentially extensive overlap in BCR structure and specificities among 
groups of B-CLL cases, this approach may be far less daunting. Indeed, since ~20% of the cases 
with unmutated IgVn genes fall into one of these Sets, such targeting might be most effective in 
those cases that have the worst prognosis, are least responsive to therapy, and have the most 
aggressive clinical courses (Damle et al., 1999; Hamblin et al., 1999). 

Li view of the above, it will be seen that the several advantages of the invention are 
achieved and other advantages attained. 

As various changes could be made in the above methods and compositions without 
departing from the scope of the invention, it is intended that all matter contained in the above 
description and shown in the accompanying drawings shall be interpreted as illustrative and not in 
a limiting sense. 

All references cited in this specification are hereby incorporated by reference. The 
discussion of the references herein is intended merely to sunmiarize the eissertions made by the 
authors and no admission is made that any reference constitutes prior art. Applicants reserve the 
right to challenge the accuracy and pertinence of the cited references. 



